NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations

https://doi.org/10.1016/j.aiopen.2024.08.005

Liu, Qisai; Lee, Xian Yeow; Sarkar, Soumik (January 2024, AI Open)

Full Text Available
Multi-fidelity machine learning models for structure–property mapping of organic electronics

https://doi.org/10.1016/j.commatsci.2022.111599

Yang, Chih-Hsuan; Pokuri, Balaji Sesha; Lee, Xian Yeow; Balakrishnan, Sangeeth; Hegde, Chinmay; Sarkar, Soumik; Ganapathysubramanian, Baskar (October 2022, Computational Materials Science)

Full Text Available
MDPGT: Momentum-Based Decentralized Policy Gradient Tracking

https://doi.org/10.1609/aaai.v36i9.21169

Jiang, Zhanhong; Lee, Xian Yeow; Tan, Sin Yong; Tan, Kai Liang; Balu, Aditya; Lee, Young M; Hegde, Chinmay; Sarkar, Soumik (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

We propose a novel policy gradient method for multi-agent reinforcement learning, which leverages two different variance-reduction techniques and does not require large batches over iterations. Specifically, we propose a momentum-based decentralized policy gradient tracking (MDPGT) where a new momentum-based variance reduction technique is used to approximate the local policy gradient surrogate with importance sampling, and an intermediate parameter is adopted to track two consecutive policy gradient surrogates. MDPGT provably achieves the best available sample complexity of O(N -1 e -3) for converging to an e-stationary point of the global average of N local performance functions (possibly nonconcave). This outperforms the state-of-the-art sample complexity in decentralized model-free reinforcement learning and when initialized with a single trajectory, the sample complexity matches those obtained by the existing decentralized policy gradient methods. We further validate the theoretical claim for the Gaussian policy function. When the required error tolerance e is small enough, MDPGT leads to a linear speed up, which has been previously established in decentralized stochastic optimization, but not for reinforcement learning. Lastly, we provide empirical results on a multi-agent reinforcement learning benchmark environment to support our theoretical findings.
more » « less
Full Text Available
Multi-resolution 3D CNN for learning multi-scale spatial features in CAD models

https://doi.org/10.1016/j.cagd.2021.102038

Ghadai, Sambit; Lee, Xian Yeow; Balu, Aditya; Sarkar, Soumik; Krishnamurthy, Adarsh (November 2021, Computer Aided Geometric Design)

Full Text Available
Query-based targeted action-space adversarial policies on deep reinforcement learning agents

https://doi.org/10.1145/3450267.3450537

Lee, Xian Yeow; Esfandiari, Yasaman; Tan, Kai Liang; Sarkar, Soumik (May 2021, ICCPS '21: Proceedings of the ACM/IEEE 12th International Conference on Cyber-Physical Systems)
Spatiotemporally Constrained Action Space Attacks on Deep Reinforcement Learning Agents

https://doi.org/10.1609/aaai.v34i04.5887

Lee, Xian Yeow; Ghadai, Sambit; Tan, Kai Liang; Hegde, Chinmay; Sarkar, Soumik (June 2020, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Robustness of Deep Reinforcement Learning (DRL) algorithms towards adversarial attacks in real world applications such as those deployed in cyber-physical systems (CPS) are of increasing concern. Numerous studies have investigated the mechanisms of attacks on the RL agent's state space. Nonetheless, attacks on the RL agent's action space (corresponding to actuators in engineering systems) are equally perverse, but such attacks are relatively less studied in the ML literature. In this work, we first frame the problem as an optimization problem of minimizing the cumulative reward of an RL agent with decoupled constraints as the budget of attack. We propose the white-box Myopic Action Space (MAS) attack algorithm that distributes the attacks across the action space dimensions. Next, we reformulate the optimization problem above with the same objective function, but with a temporally coupled constraint on the attack budget to take into account the approximated dynamics of the agent. This leads to the white-box Look-ahead Action Space (LAS) attack algorithm that distributes the attacks across the action and temporal dimensions. Our results showed that using the same amount of resources, the LAS attack deteriorates the agent's performance significantly more than the MAS attack. This reveals the possibility that with limited resource, an adversary can utilize the agent's dynamics to malevolently craft attacks that causes the agent to fail. Additionally, we leverage these attack strategies as a possible tool to gain insights on the potential vulnerabilities of DRL agents.
more » « less
Full Text Available
Fast inverse design of microstructures via generative invariance networks

https://doi.org/10.1038/s43588-021-00045-8

Lee, Xian Yeow; Waite, Joshua R.; Yang, Chih-Hsuan; Pokuri, Balaji Sesha; Joshi, Ameya; Balu, Aditya; Hegde, Chinmay; Ganapathysubramanian, Baskar; Sarkar, Soumik (March 2021, Nature Computational Science)

Full Text Available

Search for: All records